-1- 



DocketNo. SVL920030045US1 
Firm No. 0057.0008 



METHOD, SYSTEM, AND PROGRAM FOR 
PREDICATE PROCESSING BY ITERATOR FUNCTIONS 



BACKGROUND OF THE INVENTION 
5 1. Field of the Invention 

[0001] The present invention is related to predicate processing by iterator functions. 



2. Description of the Related Art 

[0002] A database system is one in which data is stored, retrieved, and processed. Data 
10 records in a relational database management system (RDBMS) in a computer are 

maintained in tables, which are a collection of rows of data all having the same columns. 
Each column maintains information on a particular type of data for the data records 
which comprise the rows of data. Tables in the database are searched using, for example, 
a Structured Query Language (SQL), which specifies search operations or predicates to 
15 perform on columns of tables in the database to qualify rows of data in the database 
tables that satisfy the search conditions. A predicate is an element of a search condition 
that expresses or implies a comparison operation (e.g., customer number > 100 or 
departmentname = "finance"). 

[0003] Relational DataBase Management System (RDBMS) software using a SQL 
20 interface is well known in the art. The SQL interface has evolved into a standard 
language for RDBMS software and has been adopted as such by both the American 
National Standards Institute (ANSI) and the International Standards Organization (ISO). 
[0004] One type of function that may be included in a SQL statement is an iterator 
function. Iterator functions may also referred to as "table" functions in the DB2® UDB® 
25 system (e.g., for OS/390®, Linux®, UNIX®, and Windows® operating systems) 

available from International Business Machines Corporation. These iterator functions 
may be written in, for example, C, C++, JAVA®, or SQL. An iterator function receives 
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a set of arguments and returns a table to the SQL statement that invokes the function. A 
table function may be invoked in the FROM clause of a SELECT statement. 
[0005] An iterator function may be used to retrieve/extract data from an external source 
(e.g., a text search engine or a spread sheet) into a database. The result set (i.e., rows of 
5 data) returned by an iterator function may be treated as a table and may be referred to as a 
"virtual table". When an iterator function is used in the FROM clause of a SELECT 
statement, columns in rows of data returned by the iterator function are mapped to a 
virtual table, and the columns of the virtual table may be used in predicate expressions 
and may be joined with columns of other tables. 
10 [0006] The following sample SQL statement (1) includes a transpose("IBM") iterator 
function: 

Statement (1) 

SELECT vcoll.high, vcoll.low, vcoll. timestamp 

FROM TABLE(FUNCTION transpose("IBM")) vtabl(vcoll), auditjable aut 
15 WHERE vcoll.time_stamp>'1970-01-01 00:00:00.00000* AND 
vcoll.time_stamp <'1971-01-01 00:00:00.00000' AND 
vcoll.time_stamp = aut.time_stamp; 

[0007] The transpose("IBM") iterator function operates on timeseries data that stores 
20 historical data (e.g., of stock symbols) as binary large objects (BLOBs) and returns a set 
of rows that are stored in result table "vtabl" with column "vcoll". The column "vcoll" 
is of a row datatype. A row datatype may have multiple fields, and, in this example, one 
of the fields is a timestamp field represented by "t". Therefore, vcoll. timestamp is used 
to access a timestamp field in a row of data. The predicates on vcoll. timestamp (i.e., 
25 vcoll. time_stamp>' 1970-0 1-01 00:00:00.00000' AND vcoll. time jstamp <'1971-01-01 
00:00:00.00000') are used to filter the rows of data with the specified time range and are 
also used in the join filter (vcoll. time stamp = aut.time stamp). 
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[0008] In Statement (1), the qualification (i.e., the predicates) on the table returned by the 
transpose("IBM") iterator function is: 

vcoll.time_stamp> , 1970-01-01 00:00:00.00000' AND 
5 vcoll.time_stamp <'1971-01-01 00:00:00.00000' AND 

vcoll.timestamp = aut.time_stamp; 

[0009] Typically, the iterator function creates a virtual table with a result set. Then, the 
qualification is applied to the virtual table to filter rows of data in the virtual table. In 
10 many cases, only a small percentage of rows of data in the virtual table remain after the 
qualification is applied. Unfortunately, because the qualification is applied after the rows 
of data are retrieved for the result set, many rows of data are unnecessarily retrieved for 
the virtual table. 

[0010] The following sample SQL statement (2) includes a text search iterator function: 
15 Statement (2) 

SELECT a.isbn, a.year, b.score 
FROM articles a, 

text_search('articles ! , 'abstract 1 , "Bush" in same sentence as "recession") b 
WHERE a.year >= 2000 AND a.isbn = b.key AND b.score >= 0.9 
20 ORDER BY 3 

[0011] The text_search() iterator function returns a key and a score for articles that have 
"Bush" in the same sentence as "recession" from an external data source "b". The 
application program submitting SQL statement (2) is interested in articles with a score 
25 (assigned by the text_search() iterator function) greater than or equal to 0.9. Because the 
text_search() iterator function is not aware of the score qualification, the text_search() 
iterator function returns a virtual table with all articles that have "Bush" in the same 
sentence as "recession." Not having knowledge about the qualification and filtering rows 



Docket No. SVL920030045US1 
Firm No. 0057.0008 



of data after the virtual table is created by the textsearch iterator function is an overhead 
on query performance and system resources. Note that the database engine would 
process the a.year >= 2000 AND a.isbn = b.key predicates as the iterator function does 
not have access to table "a". Alternatively, a new text_search() function may be written 
5 with a new parameter for passing in the score to the text_search() function. However, if 
the overloading of the function is not allowed, this leads to a situation in which a user 
chooses the right name of the parameter to be passed in, requiring new development and 
release of software whenever there is a need to add a new predicate to the search criteria. 

10 [0012] The DB2® Text Extender product (available from International Business 

Machines Corporation) has a hard-coded solution to this problem. For more information 
on the DB2® Text Extender, see "DB2® Optimization in Support of Full Text Search", 
by Albert Maier and David Simmen, Bulletin of the Technical Committee on Data 
Engineering, Volume 24, No. 4, IEEE Computer Society, December 2001. 

15 [0013] Thus, there is a need in the art for a generalized solution to allow predicate 
processing by iterator functions. 

SUMMARY OF THE INVENTION 
[0014] Provided arc a method, system, and program for processing predicates in an 

20 iterator function. When an iterator function included in a statement is invoked, one or 
more predicates included in the statement are obtained. The one or more predicates are 
applied to a row of data. If applying the one or more predicates results in a match, the 
row of data is returned. If applying the one or more predicates does not result in a 
match, a search is performed for another row of data for which application of the one or 

25 more predicates results in a match. 

[0015] Also disclosed is a method, system, and program for processing predicates in 
which a statement including an iterator function and one or more predicates is received. 
A qualification descriptor that describes the one or more predicates and one or more 
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functions that are to be used to evaluate the one or more predicates is created. Then, 
iterator function is invoked one or more times, until receiving a done indicator from 
iterator function. 



5 BRIEF DESCRTPTTON OF TH E DRAWINGS 

Referring now to the drawings in which like reference numbers represent 
corresponding parts throughout: 

FIG. 1 illustrates, in a block diagram, a computing environment in accordance 
with certain implementations of the invention. 
10 FIG. 2 illustrates logic implemented in a data store engine in accordance with 

certain implementations of the invention. 

FIGs. 3A, 3B, 3C, and 3D illustrate qualification descriptors in accordance with 
certain implementations of the invention. 

FIGs. 4A and 4B illustrate logic implemented in an iterator function processor in 
1 5 accordance with certain implementations of the invention. 

FIG. 5 illustrates an architecture of a computer system that may be used in 
accordance with certain implementations of the invention. 



DFTATT ED DESCRIPTION 
20 [0016] In the following description, reference is made to the accompanying drawings 
which form a part hereof and which illustrate several implementations of the present 
invention. It is understood that other implementations may be utilized and structural and 
operational changes may be made without departing from the scope of the present 
invention. 

25 [0017] FIG. 1 illustrates, in a block diagram, a computing environment in accordance 
with certain implementations of the invention. A client computer 100 is connected via a 
network 190 to a server computer 120. The client computer 100 may comprise any 
computing device known in the art, such as a server, mainframe, workstation, personal 
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computer, hand held computer, laptop telephony device, network appliance, etc. The 
network 190 may comprise any type of network, such as, for example, a Storage Area 
Network (SAN), a Source Area Network (LAN), Wide Area Network (WAN), the 
Internet, an Intranet, the World Wide Web (WWW or "Web") etc. The client computer 
5 100 includes system memory 104, which may be implemented in volatile and/or 
non-volatile devices. One or more client applications 1 10 may execute in the system 
memory 104. The client applications 1 10 may submit queries (e.g., SQL queries) to the 
server computer 120 for execution. The queries may include iterator functions. 
[0018] The server computer 120 includes system memory 122, which may be 
10 implemented in volatile and/or non-volatile devices. A data store engine 130 executes in 
the system memory 122. Programs running in the data store engine 130 (e.g., User 
Defined Functions (UDFs)) may also issue queries (e.g., SQL queries) to be processed by 
the data store engine 130, and these queries may include iterator functions (e.g., in the 
FROM clauses of SELECT statements). A iterator function processor 132 executes as 
15 part of the data store engine 130 and may invoke external applications on the network 
190 (e.g., on the World Wide Web), such as search engines or Web services. The 
iterator functions may obtain data from different locations, such as local memory, local 
disk, or storage sources connected to the network 190. Additionally, one or more server 
applications 140 execute in system memory 122. 
20 [0019] The server computer 120 provides the client computer 100 with access to data in a 
data store 170 (e.g., a database). Although a single data store 170 is illustrated for ease 
of understanding, tables 150 and other data in data store 170 may be stored in data stores 
at other computers connected to server computer 120. Also, an operator console 180 
executes one or more applications 182 and is used to access the server computer 120 and 

25 the data store 170. 

[0020] The data store 170 may comprise an array of storage devices, such as Direct 
Access Storage Devices (DASDs), Just a Bunch of Disks (JBOD), Redundant Array of 
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Independent Disks (RAID), virtualization device, etc. The data store 170 includes tables 
150 that are used with implementations of the invention. 

[0021] FIG. 2 illustrates logic implemented in a data store engine 130 in accordance with 
certain implementations of the invention. Control begins at block 200 with the data store 

5 engine 130 receiving a statement (e.g., a SQL statement) including an iterator function. 
Although examples herein may refer to SQL statements, implementations of the 
invention are applicable to statements in other languages (e.g., X-Query). The statement 
may be received from a client application 1 10 or an application 182. 
[0022] In block 210, the data store engine 130 creates a qualification descriptor. A 

10 qualification descriptor may be described as an internal representation of a qualification. 
The term "qualification" refers to predicates. Also, the term "qualifiers" may be used to 
refer to the predicates. In certain implementations of the invention, a qualification 
descriptor is a dynamically allocated structure that describes one or more predicates in a 
query and provides information to evaluate the predicates. In particular, the 

15 qualification descriptor describes the predicates in the statement and provides a handle to 
each function that is used to evaluate one of the predicates. In certain implementations, 
each predicate is evaluated by a separate function. 

[0023] The following Example (1) provides a SELECT statement, in which the predicate 
to be evaluated by the iterator function processor 132 is vcol 1 .time_stamp > '1994-01-05 
20 00:00:00.0000'. 

Example (1) 

SELECT vcoll.time_stamp, vcol 1. high 

FROM table (function transpose('TBM")) vtabl(vcoll) 

WHERE vcoll.time_stamp > '1994-01-05 00:00:00.0000'; 

25 

[0024] Although examples herein may refer to a transpose() iterator function, 
implementations of the invention are applicable to any iterator function, including, for 
example, the following: text_search(), web_service(), or spatial searchQ. For example, 
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for the web_service() iterator function, when SQL is used to invoke a web service, the 
predicates may be pushed across the web to minimize dataflow. Spatial(l) provides an 
example of a spatial_search() iterator function: 

Spatial(l) 

5 SELECT sptab.location, sp_distance(my_tab.myloc,sp_tab.location) 
FROM spatial_search('location', "nearest pharmacy") sp Jab 
WHERE sp_distance(my_tab.myloc, spjab.location) < 2.0 and 
sp_tab.pharmacyj*ating = 1; 

10 [0025] FIG. 3 A illustrates a sample qualification descriptor 300 for Example (1) in 
accordance with certain implementations of the invention. A qualification descriptor 
300 includes predicates 302, number of predicates 304, column number in the predicate 
306, qualification descriptors 308, type of value 310, functions to execute 312, and a 
User Defined Function (UDF) expression of the predicate 314. Predicates 302 specifies 

1 5 one or more predicates, and, for Example ( 1 ), specifies vcol 1 . time stamp > ' 1 994-0 1 -05 
00:00:00.0000' A number of predicates 304 specifies the number of predicates to be 
applied by the iterator function processor 1 32, and, for Example ( 1), is one. The column 
number in the predicate 306 identifies the column of the predicate for which a value is 
provided, and, for Example (1), is one. The qualification descriptors 308 provide 

20 pointers to additional qualification descriptors for each of multiple predicates, and for 
Example (1), since there is one predicate, additional qualification descriptors 308 are not 
available. The type of value 3 10, for Example (1), specifies a constant. The functions to 
execute 312 provide a handle for each function to be executed, and, for Example (1), 
provides a handle to the greaterthan function for Example (1). A UDF expression 314 

25 provides a graph-like representation that is used to process predicates, and, for Example 
(1), the UDF expression 314 is illustrated by 316. 
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[0026] The following Example (2) provides a SELECT statement, in which the predicates 
to be evaluated by the iterator function processor 132 are vcoll.time_stamp < 
'1994-01-05 00:00:00.0000' and vcoll. high < 20.00. 

Example (2) 

5 SELECT vcol 1 . time_stamp, vcol 1 .high 

FROM table (function transpose("IBM")) vtabl (vcoll) 

WHERE vcoll.time_stamp < '1994-01-05 00:00:00.0000' and vcoll. high < 20.00; 

[0027] FIG. 3B illustrates a sample qualification descriptor 320 for Example (2) in 
10 accordance with certain implementations of the invention. In qualification descriptor 
320, the number of predicates is two, and the column number in the predicate is not 
available as this information is in the qualification for simple predicates. Also, the 
qualification descriptors include two pointers, one for each predicate, and each pointer 
points to a qualification descriptor for that predicate. In the UDF expression for the 
15 predicate, the qualification for AND is the root. 

[0028] The following Example (3) provides a SELECT statement, in which the predicates 
to be evaluated by the iterator function processor 132 are vcoll.time_stamp < 
'1994-01-05 00:00:00.0000' and vcoll.high < 20.00. 

Example (3) 

20 SELECT vcol 1 .t, vcol 1 .high 

FROM table (function transpose("IBM")) vtabl (vcoll) 
WHERE vcoll.t < '1994-01-05 00:00:00.0000' and 

vcoll.high < 20.00 and vcoll.high = audit_table.high; 

25 [0029] FIG. 3C illustrates a sample qualification descriptor 340 for Example (3) in 
accordance with certain implementations of the invention. In qualification descriptor 
340, the number of predicates is two because, although Example (3) has three predicates, 
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the iterator function processor 132 processes two of the three predicates. In particular, 
the qualification does not include the join predicate for vcoll.high = audit_table.high. 
[0030] Example (4) provides a SELECT statement, in which the predicates to be 
evaluated by the iterator function processor 132 are vcoll. timestamp < '1994-01-05 
5 00:00:00.0000' and vcoll. high < 20.00. 

Example (4) 

SELECT vcoll. t, vcoll.high 

FROM table (function transpose("IBM")) vtabl (vcoll) 
WHERE vcoll.t < '1994-01-05 00:00:00.0000' or 
10 vcoll.high = 15.00 

[0031] FIG. 3D illustrates a sample qualification descriptor 360 for Example (4) in 
accordance with certain implementations of the invention. In the UDF expression for the 
predicate, the qualification for OR is the root. 

15 [0032] In block 220, the data store engine 130 invokes the iterator function. The logic 
for processing the iterator function is represented by the iterator function processor 132. 
The data store engine 130 may also perform other processing prior to invoking the 
iterator function. For example, the data store engine 130 performs parsing and 
optimization before executing the statement including the iterator function. In block 230, 

20 the data store engine 130 determines whether the iterator function processor 132 returned 
a row of data or a done indicator. A done indicator is returned by the iterator function 
processor 132 after the iterator function processor 132 has processed all rows of data it 
has to process. If the iterator function processor 132 returned a row of data, then 
processing continues to block 240, otherwise, processing continues to block 250. 

25 [0033] In block 240, the data store engine 130 stores the row of data received from the 
iterator function processor 132 for which simple and/or complex predicates have already 
been applied and processing continues to block 250. The predicates passed to and 
evaluated by the iterator function may be a combination of simple predicates, and the 
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combination may be referred to as a complex predicate. Simple predicates are ones 
which compare a column value to a constant. In particular, in certain implementations, 
the iterator function applies simple and/or complex predicates to rows of data and returns 
a row of data for which application of the predicates results in a match to the 
5 qualification. That is, in certain implementations, the iterator function processor 132 
applies predicates involving constants against a column in the result set that the iterator 
function processor 132 returns. 

[0034] In block 250, the data store engine 130 applies predicates in the statement not 
already applied by the iterator function processor 132. These predicates may include 

10 simple and/or complex predicates not applied by the iterator function processor 132 (e.g., 
due to lack of resources), join predicates, and predicates on columns not in the result set 
of the iterator function. That is, in certain implementations, the data store engine 130 and 
the iterator function processor 132 may each apply some of the simple predicates (i.e., the 
iterator function processor 132 does not apply all of the simple predicates in these 

15 implementations). In block 260, the data store engine 130 determines whether the 

iterator function is to be invoked again. In certain implementations, the iterator function 
is invoked again if the iterator function did not return a done indicator to indicate that all 
rows have been processed. If so, processing loops back to block 220, otherwise, 
processing continues to block 270. In block 270, the data store engine returns the result 

20 of processing the statement (e.g., returns a table or a set of rows of data) to the client 
application 1 10 or application 182 that submitted the statement. 
[0035] FIGs. 4A and 4B illustrate logic implemented in an iterator function processor 
132 in accordance with certain implementations of the invention. Control begins at block 
400 with the iterator function processor 132 performing initialization processing if the 

25 initialization is to be performed. In particular, when the data store engine 130 invokes 
the iterator function, the data store engine 130 calls an Application Programming 
Interface (API) function to get the qualification descriptor that describes the predicates 
and the functions to evaluate each predicate. Other initialization processing, such as 
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initializing data structures (e.g., predicate evaluators) may also be performed. In block 
402, the iterator function processor 132 retrieves the next row of data, starting with the 
first. 

[0036] In block 404, the iterator function processor 132 applies one or more of the 

5 obtained predicates to the retrieved row of data. That is, for each row of data that is 
generated by the iterator function processor 132, the appropriate function specified in the 
qualification descriptor is applied on the corresponding predicate column. For each 
predicate (e.g., for a built-in datatype or a user-defined datatype), implementations of the 
invention will provide a function (e.g., a User Defined Function (UDF) or a system 

1 0 defined function) to evaluate that predicate on a column. In certain implementations, 
each simple predicate is a Boolean expression. If the predicate is an integer comparison 
(e.g., equal), a handle to the function that performs the integer equal comparison and 
returns a Boolean value is provided in the qualification descriptor. For each row of data, 
the iterator function processor 132 calls one or more functions on the data referenced by 

1 5 a predicate, and each of the functions processes the predicate. In certain 

implementations, one function corresponds to one type of predicate (e.g., an equal 
comparison). In certain alternative implementations, one function may process more than 
one type of predicate. The results from the functions may be combined using a predicate 
expression (e.g., combining AND and OR connectors in the statement). 

20 [0037] In certain implementations of the invention, an IBM® Informix® Virtual-Table 
Interface provides a mechanism to represent and process predicates. The mechanism 
may be thought of as a set of interfaces to access the predicates and access the type of 
each predicate and to combine the result of each predicate. This mechanism may also be 
used to push down other query processing information, such as the requested sort order 

25 for the result so that the iterator function may process this information as well. The 
IBM® Informix® Virtual-Table Interface is further described in the IBM® Informix® 
Virtual-Table Interface Programmers Guide, Version 9.1, March 2003, which is 
available from International Business Machines Corporation. 
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[0038J In block 406, the iterator function processor 132 determines whether the 
predicates were applied successfully. For example, the iterator function processor 132 is 
able to apply simple or complex predicates, but not join predicates. Also, the iterator 
function processor 132 may be unable to process predicates for other reasons, such as a 

5 lack of resources or due to unexpected errors. In certain alternative implementations, the 
iterator function processor 132 may also determine whether some predicates should be 
applied by the data store engine 130 (e.g., because the data store engine 130 has more 
processing resources). In certain implementations, the iterator function processor 132 
does not process the predicates if any one of the predicates is to be applied by the data 

10 store engine 1 30. In this case, from block 406, if the iterator function processor 132 is 
able to process all of the predicates, then processing continues to block 408, otherwise, 
processing continues to block 410. 

[0039] In certain alternative implementations, the iterator function processor 132 applies 
those predicates that may be applied by the iterator function processor 132 and sends 
15 indicators to the data store engine 130 to apply the remaining predicates. In this case, 
from block 406, if at least one of the predicates may be processed by the iterator function 
processor 132, then processing continues to block 408, otherwise, processing continues to 
block 410. 

[0040] In block 408, the iterator function processor 132 sets one or more indicators to 
20 indicate which predicates were applied by the iterator function processor 132, and 
processing continues to block 414. 

[0041] In block 410, the iterator function processor 132 sets one or more indicators to 
indicate that predicates can not be successfully applied by the iterator function processor 
132 and processing continues to block 412. In block 412, the iterator function processor 
25 132 returns the retrieved row to the data store engine 130. 

[0042] By setting the one or more indicators, the iterator function processor 132 is able to 
notify the data store engine 130 which predicates the iterator function processor 132 has 
applied, so that the data store engine 130 can save resources and avoid applying these 
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applied predicates. Also, the data store engine 130 is able to apply those predicates that 
the iterator function processor 132 did not apply so that the statement initially received 
that includes an iterator function is evaluated correctly. 

[0043J In block 414, the iterator function processor 132 determines whether there is a 
5 match. In certain implementations of the invention, there is a "match" if the row of data 
qualifies for selection based on application of one or more predicates (i.e., all of the 
predicates for that iterator function). If there is a match, processing continues to block 
416, otherwise, processing continues to block 418. In block 416, the iterator function 
processor 132 returns the row of data to the data store engine 130. That is, the iterator 
10 function processor 132 returns a row of data if the predicate evaluation returns TRUE 
(i.e., each of the functions returned in the qualification descriptor return TRUE). 
[0044] If there is no match, in block 418, the iterator function processor 132 discards the 
row of data. In block 420, the iterator function processor 132 determines whether all 
rows of data have been processed. If so, processing continues to block 422, otherwise, 
15 processing loops back to block 402 to retrieve another row of data. That is, if there was 
not a match, the iterator function processor 132 continues searching for a next qualifying 
row of data 

[0045] Thus, to minimize data flow from the iterator function to the data store engine 
130, implementations of the invention make predicates for a given query available to the 
20 iterator function processor 132. The following sample SQL statement (3) includes a 
transpose('TBM") iterator function. 

Statement (3) 

SELECT vcoll.high, vcoll.low, vcoll.time_stamp 

FROM TABLE(FUNCTION transposeO'IBM")) vtabl(vcoll), auditjable aut 
25 WHERE vcoll.time_stamp> , 1970-01-01 00:00:00.00000' AND 
vcoll.time_stamp <'1971-01-01 00:00:00.00000 f AND 
vcoll.time_stamp = aut.time_stamp AND 
aut.name= , TBM"; 
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[0046] The result set from the transpose("IBM") iterator function is mapped to column 
"vcol 1 " of table " vtab 1 1 '. In this example, since the transposeO iterator function returns a 
row of data (i.e., data of the row datatype), column "vcoll" is of row datatype. In 
general, an iterator function may return data of any datatype (e.g., system defined or user 
5 defined datatypes), however, an iterator function in a statement returns data of one 

datatype, as specified in the statement. In this example, column "vcoll" has time (t), high 
(high), low(low) fields. 

[0047] The iterator function processor 132 calls an API function to get the qualification 
descriptor that describes the predicates and the functions to evaluate each predicate. 
1 0 [0048] For each row of data that is generated by the iterator function processor 1 32, the 
appropriate function specified in the qualification descriptor is applied on the 
corresponding predicate column. 

[0049] The qualification descriptor obtained by the transposeQ iterator function in 
statement (3) includes the following predicates on the result columns: 

15 

vcoll.time_stamp> , 1970-01-01 00:00:00.00000' AND 
vcoll. time_stamp <'1971-01-01 00:00:00.00000' AND 
vcoll.time_stamp = aut.timestamp 

20 [0050] The qualification descriptor obtained by the transpose() iterator function in 
statement (3) does not include the predicate aut.name = "IBM" because the column 
"aut.name" is not in the result set of the transpose() iterator function. The iterator 
function processor 132 applies the predicates in the qualification descriptor, and the data 
store engine 130 applies the predicate aut.name = "IBM". Similarly, the data store 

25 engine 130 would apply any join predicates. 

[0051] Implementations of the invention provide a new function, which for ease of 
reference will be referred to as a get_qualification_descriptor function. In certain 
implementations of the invention, the get_qualification_descriptor function is an IBM® 
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Informix® Virtual-Table Interface API function that is used to obtain a qualification 
descriptor when an iterator function is used in the FROM clause of a SELECT statement 
and may have the format mMter_qual_get(). 

[0052] The qualification descriptor may be used with an existing IBM® Informix® 
5 Virtual-Table Interface API function that is available for the IBM® Informix® Dynamic 
Server 9.x version (where x in 9.x represents a positive integer value). 
[0053] The following is example psuedocode for a transpose() iterator function: 

/*ts_transpose_sql() is an example transpose iterator function.*/ 
10 void* 

tsJranspose_sql(ts_timeseries * ts, MIJFPARAM*fparam) 
/* "timeseries" refers to a datatype, and ts stores timeseries data; 
fparam stores function parameters, such as the state of a function.*/ 

{ 

15 /*mi_fp_request() indicates whether this is a first call from the data store engine.*/ 

if (mi_fp_request(fParam) = = SETJNIT) { 

/♦if mi_fp_request(fParam) is set to initialization, then the following 

processing is performed.*/ 

/*mi_zalloc() allocates space for a structure.*/ 
20 state = (struct ts_state *) mi_zalloc(sizeof(struct ts_state)); 

/*Now, get the start and stop flags.*/ 

/*mi_iter_qual_get() obtains predicates on the query by obtaining the 
qualification descriptor and store in qd.*/ 
25 qd=mi_iter_qual_get() ; 

if(qd!=NULL) 

/* If the qualification is not NULL, the following processing is 
performed.*/ 
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/*ts_get_start_and_stop() gets the start and stop flags.*/ 
ts_get_start_and_stop(state, qd, &state->flags); 

/*timeseries_open() opens the timeseries data and 
5 begins a scan at a given time.*/ 

state->tsdesc=timeseries_open(ts); 

/*Timeseries_scan() uses the predicates to set 
the window of rows of data to return. */ 
10 state->scan = timeseries_scan(state->tsdesc, state->start, state->stop); 

return (0); 



/* Get the saved state.*/ 

state = (structjs state *) miJpfuncstate(fParam); 
1 5 scan = state->scan; 
tsdesc = state->tsdesc; 



if (MI_FP_REQUEST(fParam) = - SETEND) 
return (0); 

20 

/* get_next_timeseries_elem() returns next row of data that qualifies when an additional 
row of data exists and returns TS_SCANJEOS when stop date is reached. */ 
ret = get_next_timeseries_elem(scan, &element); 

25 switch(ret) { 

case TS_SCAN_EOS: /* End of rows of data.*/ 

MI_FP_SETISDONE(fParam, MI TRUE); /*Set done indicator to true.*/ 
case TS_SCAN_NULL: /* Column value is NULL, which is a valid value. */ 



c 
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return (0); 

case TS_SCAN_ELEM: /* A qualifying row of data was retrieved. */ 
return(element); /*A qualifying row of data is returned. */ 

} 

5 } 

/*qd includes the predicates on the statement invoking the iterator function. */ 
/* ts_get_start_and_stop() finds start and stop date times in qualification. */ 

10 ts_get_start_and_stop(struct ts_state *state, MI_AM_QUALJDESC*qd, 

mMnteger* flags) 

{ 

if (mi quaMssimple(qd)) { 
if (qual column != 0) 
15 /*If the qual column is not the timestamp column, return. */ 

return; 

/* This is a simple qualification on the timestamp column and the operator 
needs to be found. */ 

20 

/*mi_qual_funcname() gets the name of the function that will be used to 

evaluate the predicate./* 

funcname = mi_qual_funcname(qd); 

25 /*mi_qual_constant() = gets the value of a constant for evaluating the 

predicate. */ 

stamp = mi_qual_constant(qd); 
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if (mi_qual_negate(qd) { 

/*If the qualification is negated, the following is performed.*/ 
if(funcname = = "greaterthanorequal" || 
funcname = = "greaterthan") 
5 state->stop = stamp; 

else if(funcname = = "lessthanorequal" || funcname = = "lessthan") 
state->start = stamp; 

else if(funcname = = "notequal" { 
1 0 state->start = stamp; 

state->stop = stamp; 

} else{ 

/* If the qualification is not negated, the following is performed.*/ 
if(funcname = = "greaterthanorequal" || 
15 funcname = = "greaterthan") 

state->start = stamp; 
else if(funcname = = "lessthanorequal" || 

fimcname = = "lessthan") 

state->stop = stamp; 
20 else if(funcname = = "equal" { 

state->start = stamp; 

state->stop = stamp; 

} 

} 

25 return 
} else { 

/*If there compound qualification, recurse to find each simple qualification.*/ 
pred_cnt = mi_qual_nquals(qd); 
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/*mi_qual_nquals() returns the number of predicates in the compound 

qualification./* 

for (i=0; i<pred_cnt; i++) 

ts_get_start_and_stop(state, mi_qual_qual(qd,i), flags); 

5 } 
} 



[0054] Thus, implementations of the invention enable a table function to access a 
qualification and return only rows of data that satisfy the qualification. In many cases, 

10 this reduces the number of rows of data in the result set, which leads to fewer retrievals 
of data and which creates a table that uses less memory. For example, in prior systems in 
which the data store engine 130 applied the qualification, 10,000 rows of data may be 
returned by the iterator function. With implementations of the invention, because the 
iterator function (rather than the data store engine 130) applies the qualification, 10 rows 

1 5 of data may be returned for storage in a virtual table. Additionally, because the iterator 
function processor 132 returns a next qualifying row, the data store engine 130 calls the 
iterator function processor 132 fewer times to retrieve rows of data. 
[0055] IBM, DB2, OS/390, UDB, and Informix are registered trademarks or common law 
marks of International Business Machines Corporation in the United States and/or other 

20 countries. JAVA is a registered trademark or common law mark of Sun Microsystems in 
the United States and/or other countries. Windows is a registered trademark of Microsoft 
Corporation in the United States and/or other countries. Linux is a registered trademark 
of Linus Torvalds in the United States and/or other countries. UNIX is a registered 
trademark or common law mark of The Open Group in the United States and/or other 

25 countries. 
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Additional Implementation Details 
[0056] The described techniques for predicate processing by iterator functions may be 
implemented as a method, apparatus or article of manufacture using standard 
programming and/or engineering techniques to produce software, firmware, hardware, or 
5 any combination thereof. The term "article of manufacture" as used herein refers to code 
or logic implemented in hardware logic (e.g., an integrated circuit chip, Programmable 
Gate Array (PGA), Application Specific Integrated Circuit (ASIC), etc.) or a computer 
readable medium, such as magnetic storage medium (e.g., hard disk drives, floppy disks,, 
tape, etc.), optical storage (CD-ROMs, optical disks, etc.), volatile and non-volatile 

10 memory devices (e.g., EEPROMs, ROMs, PROMs, RAMs, DRAMs, SRAMs, firmware, 
programmable logic, etc.). Code in the computer readable medium is accessed and 
executed by a processor. The code in which certain implementations are implemented 
may further be accessible through a transmission media or from a file server over a 
network. In such cases, the article of manufacture in which the code is implemented may 

15 comprise a transmission media, such as a network transmission line, wireless 

transmission media, signals propagating through space, radio waves, infrared signals, etc. 
Thus, the "article of manufacture" may comprise the medium in which the code is 
embodied. Additionally, the "article of manufacture" may comprise a combination of 
hardware and software components in which the code is embodied, processed, and 

20 executed. Of course, those skilled in the art will recognize that many modifications may 
be made to this configuration without departing from the scope of the present invention, 
and that the article of manufacture may comprise any information bearing medium 
known in the art. 

[0057] The logic of FIGs. 2 4A, and 4B describes specific operations occurring in a 
25 particular order. In alternative implementations, certain of the logic operations may be 
performed in a different order, modified or removed. Moreover, operations may be added 
to the above described logic and still conform to the described implementations. Further, 
operations described herein may occur sequentially or certain operations may be 
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processed in parallel, or operations described as performed by a single process may be 
performed by distributed processes. 

[0058] The illustrated logic of FIGs. 2, 4A, and 4B may be implemented in software, 
hardware, programmable and non-programmable gate array logic or in some combination 

5 of hardware, software, or gate array logic. 

[0059] FIG. 5 illustrates an architecture 500 of a computer system that may be used in 
accordance with certain implementations of the invention. Client computer 100, server 
computer 120, and/or operator console 180 may implement computer architecture 500. 
The computer architecture 500 may implement a processor 502 (e.g., a microprocessor), 

10 a memory 504 (e.g., a volatile memory device), and storage 510 (e.g., a non-volatile 
storage area, such as magnetic disk drives, optical disk drives, a tape drive, etc.). An 
operating system 505 may execute in memory 504. The storage 510 may comprise an 
internal storage device or an attached or network accessible storage. Computer programs 
506 in storage 510 may be loaded into the memory 504 and executed by the processor 

15 502 in a manner known in the art. The architecture further includes a network card 508 
to enable communication with a network. An input device 5 12 is used to provide user 
input to the processor 502, and may include a keyboard, mouse, pen-stylus, microphone, 
touch sensitive display screen, or any other activation or input mechanism known in the 
art. An output device 514 is capable of rendering information from the processor 502, or 

20 other component, such as a display monitor, printer, storage, etc. The computer 

architecture 500 of the computer systems may include fewer components than illustrated, 
additional components not illustrated herein, or some combination of the components 
illustrated and additional components. 

[0060] The computer architecture 500 may comprise any computing device known in the 
25 art, such as a mainframe, server, personal computer, workstation, laptop, handheld 

computer, telephony device, network appliance, virtualization device, storage controller, 
etc. Any processor 502 and operating system 505 known in the art may be used. 
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[0061] The foregoing description of implementations of the invention has been presented 
for the purposes of illustration and description. It is not intended to be exhaustive or to 
limit the invention to the precise form disclosed. Many modifications and variations are 
possible in light of the above teaching. It is intended that the scope of the invention be 
5 limited not by this detailed description, but rather by the claims appended hereto. The 
above specification, examples and data provide a complete description of the 
manufacture and use of the composition of the invention. Since many implementations 
of the invention can be made without departing from the spirit and scope of the invention, 
the invention resides in the claims hereinafter appended. 



